On the Detection of Concept Changes in Time-Varying Data Stream by Testing Exchangeability

نویسندگان

  • Shen-Shyang Ho
  • Harry Wechsler
چکیده

Introduction In a data streaming setting, data points are observed one by one. The concepts to be learned from the data points may change infinitely often as the data is streaming. We extend the idea of testing exchangeability online [2] to a martingale framework to detect concept changes in time-varying data streams [P1]. •Two martingale tests using: (i) martingale values (MT1) and (ii) the martingale difference (MT2) [P1] are proposed. •MT1 is shown to be an approximation of the sequential probability ratio test (SPRT). The relationship between the threshold value used and its size and power is deduced. The mean delay time before a change is detected is estimated. •Under some assumptions, MT2 theoretically has a lower probability than MT1 of rejecting the null hypothesis, “no concept change in the data stream”, when it is in fact correct [P1].

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detecting Concept Drift in Data Stream Using Semi-Supervised Classification

Data stream is a sequence of data generated from various information sources at a high speed and high volume. Classifying data streams faces the three challenges of unlimited length, online processing, and concept drift. In related research, to meet the challenge of unlimited stream length, commonly the stream is divided into fixed size windows or gradual forgetting is used. Concept drift refer...

متن کامل

Concept drift detection in business process logs using deep learning

Process mining provides a bridge between process modeling and analysis on the one hand and data mining on the other hand. Process mining aims at discovering, monitoring, and improving real processes by extracting knowledge from event logs. However, as most business processes change over time (e.g. the effects of new legislation, seasonal effects and etc.), traditional process mining techniques ...

متن کامل

F-STONE: A Fast Real-Time DDOS Attack Detection Method Using an Improved Historical Memory Management

Distributed Denial of Service (DDoS) is a common attack in recent years that can deplete the bandwidth of victim nodes by flooding packets. Based on the type and quantity of traffic used for the attack and the exploited vulnerability of the target, DDoS attacks are grouped into three categories as Volumetric attacks, Protocol attacks and Application attacks. The volumetric attack, which the pro...

متن کامل

MODELING OF GROUNDWATER FLOW OVER SLOPING BEDS IN RESPONSE TO CONSTANT RECHARGE AND STREAM OF VARYING WATER LEVEL

This paper presents an analytical model characterizing unsteady groundwater flow in an unconfined aquifer resting on a sloping impervious bed. The aquifer is in contact with a constant water level at one end. The other end is connected to a stream whose level is increasing form an initial level to a final level at a known exponentially decaying function of time. Moreover, the aquifer is repleni...

متن کامل

Concept drift detection in event logs using statistical information of variants

In recent years, business process management (BPM) has been highly regarded as an improvement in the efficiency and effectiveness of organizations. Extracting and analyzing information on business processes is an important part of this structure. But these processes are not sustainable over time and may change for a variety of reasons, such as the environment and human resources. These changes ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005